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Introduction 

In the field of the Solar system and planetary dynamics, extensive computational experiments became useful and 
well established, standard research tools. These experiments regard direct numerical integrations of complex 
equations of motion to study the long-term orbital evolution and stability of various A^-body systems (e.g., 1 10|), 
an analysis of the resonance structure of the phase space of these systems and particular classes of solutions 
(like periodic and quasi-periodic orbits, equilibria), calculating dynamical maps (3] and cross- sections, modeling 
various types of observations of extrasolar planetary systems (radial velocities, astrometric and imaging data, 
eclipse and TOA timing, photometric transits) by quasi-global evolutionary algorithms (e.g.,0), investigating 
qualitative features of basic dynamical models in the framework of the general dynamical systems theory (e.g., 
[8 1), spacecraft trajectories optimization (e.g., |4 1), just to mention a few subjects of this rapidly developing branch 
of the dynamical astronomy. 

Usually, these experiments are equivalent to performing the same or very similar numerical operation or pro- 
cedure on a large set of initial conditions (e.g., numerical integrations, dynamical maps) or intermediate data (e.g., 
when evaluating the objective function during the optimization process). In a language of computing, such calcula- 
tions may be understood as standalone numerical tasks taking seconds, but also hours and days of single CPU-time. 
If processing of large sets of such tasks is required, one can split a given numerical experiment onto smaller parts, 
and distribute them over a computing pool (usually, CPU cluster or a network of workstations). This leads however 
to task management issues, which may be handled efficiently only by dedicated software tools. 

Nowadays, there are different task management systems available. Likely, the best example is the well known 
Condor package |[T3ll . Within such a numerical framework, the user-supplied, stand-alone executable code per- 
forming computations is distributed over a computing pool. The input and output data of each software instance 
must be then handled by the host node. It requires both an efficient, possibly unified task preparation scheme, 
check-pointing and keeping intermediate results, as well as data storage and a post-processing (assembling the 
results from computing nodes). Focusing on the dynamical astronomy, we address these issues by developing a 
new management code, called Mechanic that is - unlike Condor and similar software systems - built on the basis 
of the Message Passing Interface (MPI) |3. The MPI is highly- standardized and portable message-passing sys- 
tem widely used on a variety of parallel computers and CPU-clusters. We shortly present here some key features 
of that software (Sect. [L]), and we describe its basic usage for investigating a simple dynamical system (Section 
[Z} . Moreover, Mechanic may become a new helper tool in a wide range of applications, particularly focusing on 
processing large data sets. We applied it to study the global dynamics of the v Octantis planetary system, see our 
second paper in this volume. More details will be described elsewhere (Slonina et al., 2012, in preparation). 



1. Overview of the Mechanic framework 

Mechanic distributes tasks within the well-known task farm (TF) communication pattern, which introduces the 
master — worker relationship between nodes (CPUs) in a computing pool (Fig. [T]). The architecture of our 
software mimics the Unix system architecture in terms of the core — module scheme. The core of the Mechanic 
handles the MPI communication, basic setup, task pool configuration and assignment (Fig. [2]). A particular 
numerical problem, implemented in the user-supplied module, communicates with the core through provided 
Application Programming Interface (API), which makes it possible to manage the computations basically in any 
detail. It applies to any C-interoperable programming languages (such as C++, Fortran 2003+, Nvidia CUDA or 
OpenCL frameworks) and allows to reuse existing serial software without a need of extensive modifications. The 
Mechanic core may be installed system- wide and used under common queue control systems. By design, it should 
be possible to run it under control of any UNIX-like system (Linux and Mac OS are actively maintained) and it 
works uniformly in single- and multi-CPU environments. The framework may be used in two basic modes: the 
task farm (as the default) and a master- alone mode (computations are performed only on the master node). 

One of the key features of Mechanic is a dedicated and unified data storage model built on the top of the 
HDF5 standard 1 14 |. By default, no data management is performed on worker nodes. Instead, the user-specified 
results are sent by these nodes, received by the master node and stored in one global datafile. This helps to reduce 



the overall data processing time and to avoid check-pointing issues. The flexible HDF5 storage standard makes it 
possible to use computation results independently of the host software, thanks to numerous HDF5-oriented helper 
applications and script tools. 

The task assignment has a default structure of two-dimensional grid (Fig. [3]). The master node creates a pool 
of computational tasks. Each worker node receives a task from the pool, does the computations, returns the result 
to the master node, and takes the next task to run, if available. Coordinates of a specific task are used to define 
the initial state of single-run (Fig. |4]). For instance, to compute a dynamical map of a planetary system, we create 
a mesh grid of initial conditions, which are usually constrained through observations. Each initial condition is 
mapped then to a standalone numerical task processed by worker nodes. The grid scheme is governed through the 
API (Fig.|4]). 
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Figure 1: The task farm model. The master node creates 
and manages a task pool. Each worker node takes a task 
from the pool, returns the result to the master and takes 
the next task, if available. Computations in the pool are 
finished, when all tasks are processed. 



Figure 2: Design of Mechajiic. An external, user- 
supplied software communicates with the core through 
provided API {module). A module is a C-interoperable 
code compiled in the form of shared library. 
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module_task_process (task) { 

xstep = (task->xmax - task->xmiii) / 

task->xres ; 
ystep = (task->ymax - task->ymin) / 

task->yres ; 

X = task->xmiii + task->x*xstep ; 
y = task->ymin + task->y*ystep ; 

task->result = your_task_code(x,y) ; 



Figure 3: Default task assignment has the structure two- 
dimensional grid, which suits computation of the dynam- 
ical map. Each pixel is a standalone numerical task, han- 
dled by the worker node. 



Figure 4: A sample task preparation (C-pseudocode). 
Since all information on the task is available to the 
worker node, it may be used to define the initial state 
for each run. 



The Mechanic has been already extensively tested and used in different computing environments, such as large 
CPU-clusters, workstations or laptops. The code remains in early stages of its development, however, it provides 
basic core-functionality, and is suitable to host many computational problems of the dynamical astronomy. In 



principle and by the design, the MPI and HDF5 layers are hidden (or rather transparent) to user, hence only a very 
basic knowledge of parallel programming is necessary. Advanced topics, such as an additional communication 
between worker nodes or an implementation of different storage layout require parallel computing experience. For 
details of using and developing the Mechanic framework and its modules, we refer to the user-guide available 
through the project website 1 15 1. Sample numerical modules are available separately L16J . 



2. An example of Mechanic application: the Arnold web 

To illustrate a simple but non-trivial application of Mechanic, we consider the dynamical system investigated by 
Froeschle et al. |8|: 



COS 01 + COS 02 + COS 03+4 

where actions hjhjs ^ ^ and angles 0i , 02, 03 ^ S are canonically conjugated phase-space variables, and £ is the 
perturbation parameter. For e = the dynamics are integrable. It is well known that many qualitative models 
of the classical mechanics, dynamical astronomy, theoretical physics and dynamical systems theory have such a 
general form. It was formulated by Poincare as the very basic problem of Celestial Mechanics. According to 
the Kolmogorov-Arnold-Moser theorem (KAM, see e.g. 1 1 1), if the perturbation is small enough, certain non- 
degeneracy conditions are fulfilled, and if the unperturbed motion is sufficiently non-resonant, solutions of the 
perturbed system are close to the unperturbed one, and lie on invariant tori. On the same torus, all motions are 
quasi-periodic with the same frequencies. However, the KAM theorem does not say much on the dynamics close 
to the unperturbed tori with the resonant frequencies. In the neighbourhood of such a set, called the Arnold web, 
the dynamics may be strongly chaotic and complex. To study such solutions in detail, basically only numerical 
methods can be applied. The structure of the Arnold web can be illustrated at two-dimensional plane of actions, 
{hjh)- Each point of this plane corresponds to fundamental frequencies of the unperturbed torus lO. Resonances 
in this dynamical system are represented by straight lines. Due to infinite Fourier expansion of the perturbation, 
resonances of any order are possible. To distinguish between regular and chaotic motions in the neighborhood of 
the resonances, we apply numerical fast indicator, called the Mean Exponential Growth factor of Nearby Orbits 
(MEGNO) invented by Cincotta & Simo |2|. MEGNO converges to ~ 2 for the regular motions, decreases to 
for periodic orbits, and grows linearly with time for chaotic orbits. This behaviour can be colour-coded at high- 
resolution dynamical maps, constructed at the frequencies plane (Fig. [5]) with the help of Mechanic. 

To solve the equations of motion and variational equations of the test Hamiltonian dynamical system, we 
applied symplectic integrator SABA3 by Laskar & Robutel |9| and the tangent map scheme |11, 5|. Figure [5] 
illustrates the Arnold web calculated with very fine resolution of 2048 x 2048 pixels for two values of e, and 
relatively long integration times (10^-10^ ). Subsequent panels are for £ = 0.01 and £ = 0.04. The close-ups 
reveal stunning details of the structure of the phase-space. Calculations have been run on CPU clusters up to 2048 
cores, installed at the Poznan Supercomputer Centre (reef and chimera clusters), and took up to a few hours, 
depending on the integration time and step-size. The source code of the numerical module that computes the 
Arnold web, as well as technical details are available at the project website 1 16 |. An application to modeling of 
the radial velocity observations, and a study of the global dynamics of the v Oct planetary system are described in 
this volume lfT2ll . 
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Figure 5: The Arnold web for different values of perturbation parameter e. Subsequent panels are for parts and close-ups 
of the web for £ = 0.01 £ = 0.04. The fast indicator MEGNO has been computed at each point of these maps with the 
help of SABA3 symplectic integration scheme, based on the so called tangent map, to distinguish between regular (violet 
regions) and chaotic (yellow regions) motion. Resonances are represented and visible as the straight lines. The resolution 
is 2048 x2048 pixels (see the text for details). Step-size h and the total integration time T are labeled. 
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